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Abstract — In this article we present an Elitism Levels Traverse Mech- 
anism that we designed to find bounds on population-based Evolutionary 
Algorithms solving unimodal functions. We prove its efficiency theoreti- 
cally and test it on OneMax function deriving bounds c/xn log n — 0{fm). 
This analysis can be generaUzed to any similar algorithm using variants 
of eUtist selection and genetic operators that flip or swap only 1 bit in 
each string. 

Index Terms — Evolutionary computation, Genetic algorithms, Compu- 
tational complexity 

I. Introduction 

We analyze an elitist population-based Evolutionary Algorithm with 
population size fi and recombination pool size A, (n + A)EA using 
a genetic operator 1 -Bit-Swap that recombines information between 
parents (see Hi). 

Most research in theoretical EA community is focused on 
mutation-based single species algorithms such as (1 + EA 
(see e.g. (2|-|4|) with some sharp bounds on runtime obtained for 
OneMax function such as 0.982nlogn in (4|. 

Results on population-based algorithms are less abundant, and 
are restricted to mostly (/i + l)xEA (see ||5J) with upper bound 
0{fj,n + nlogn) and (1 + A)j.EA (see |6|, |7)) with upper 
bound on OneMax 0(n log n + nX) in and all linear functions 

Although so far (/i + A) or (A^ + A*') EAs have deserved less 
attention, they have been the subject of analysis in |8|-|10|. 
Specifically, in 1101 it was derived that for a (A'' + A'') EA with 
mutation and tournament selection solving OneMax the upper bound 
is 0{nN log N + nlogn) if measured in the number of function 
evaluations. 

Unfortunately many of these results are not directly comparable 
due to the difference in selection functions (fitness-proportional, 
truncation, elitist, tournament, etc) and elitism settings (save 1 best 
species or some variable proportion). 

Even more significantly, it was shown already in |8| that population 
effect is generally problem-specific, so it is quite hard to generalize 
findings to other functions. There is ample evidence though (e.g. 
(5), (61) that for mutation-based algorithms (incl. Randomized 
Local Search, RLS) optimizing simple functions such as OneMax 
population is not beneficial and tends to degrade performance. 



II. Algorithms and Problems 

A. Algorithm 

Although the mechanism described in this paper is quite universal, we 
test it on (jj, + X)EAibs solving OneMax problem. This problem 
is well-known in EA community, recent achievements include l-^, 
liT IJ with some sharp bounds. We selected this problem due to its 
simplicity and the ability to compare our findings to those available 
already. 



TABLE I 
Algorithm 



(/i + A) Evolutionary Algorithm u.sing 1-Bit-Swap (IBS) 


1 


create /i starting species at random 


2 


t = 


3 


loop 


4 


select using a variant of fitness-proportional Tournament selection 




^ pairs of parents into the pool 


5 


swap bits in each pair 


6 


keep the currently-elite species in the population, replace the rest 




with the pool, first with new cun'ently-elite, then at random 


7 


t = t + l 


8 


end loop 



B. Selection function 

Throughout the article we analyze an elitist recombination-driven 
{fi + A) EA using a variant of tournament selection. It is both simple 
to implement and analyze. But since we recombine information 
between parents, we are interested in forming pairs of species in 
the recombination pool, and on the construction of these pairs the 
properties of the algorithm will be derived. This formation occurs in 
the following way: 

TABLE II 
Selection Function 

Variant of Tournament Selection 

2 loop 

3 select two species from the population at random 

4 examine their fitness, the better one enters the pool 

5 k = k + 1 

6 end loop 



Thus it is obvious that better-fit species have higher chances of 
entering the pool, so we can expect the proportion of a species to 
be higher in the pool rather than in the population. 



C. 1-Bit-Swap Genetic Operator 

We apply the 1-Bit-Swap operator that was found to be useful 
solving a large number of test problems in UJ and was analyzed 
extensively in 1121 . 1131 to have outperformed the mainstream RLS 
algorithm both theoretically and numerically. 

Another advantage of IBS is that we can compare it directly 
to RLS, since both are local search operators that cannot move too 
far from the current best search point. The operator works in the 
following way: 

TABLE III 
Genetic Operator 

1-Bit-Swap Operator 

2 loop 

3 select a bit in the first parent uniformly at random 

4 select a bit in the second parent uniformly at random 

5 swap values in these bits 

5 i = j + 1 

6 end loop 



IIL Definitions 

A. Fitness levels partition 

Basic approach to analyzing elitist EA with a simple 1-bit mutation 
solving unimodal binary-encoded EAs was introduced by Wegener 
in 1141 that is based on fitness partitioning: on a set of binary strings 
{0, 1}" size 2" a partition into a finite number of nonempty subsets 
A\,A2,--- ,Am is defined with ordering A\ ^ A2 ■ ■ ■ ^ Am 
s.t. all a £ Am are the global optimum. 

This approach allows definition and derivation of the lower bound of 
success probability of transition between states, Si{a) = P{Aij^\\Ai) 
and the upper bound on expected convergence time of the 
algorithm, expected first hitting time of the best fitness level, 
E{Xf) <s-^ + s:^^... + s-^_i. This idea can be extended to the 
situation when we apply non-negative weights wi^f) (see |2|, [141) 
and to derive lower bounds (by considering the upper bound on Si{a). 

Another tool used extensively in the analysis of EAs are potential 
(auxiliary) functions that measure progress (see 1141 ). This is 
especially useful when working on functions that have fitness 
plateaus (see e.g. 1151 ). in which case we make the difference: 

1) fitness functions decide whether the new binary input (species) 
is better than the old one 

2) potential function tracks the progress between states of the 
algorithm (fitness levels) 

OneMax (or some simple transformation of it) is used as a poten- 
tial function for more complicated problems (Royal Roads, Binary 
Values, Short/Long Path etc). 

B. Elitism Levels Partition 

In this article we extend this approach to a population-based elitist 
algorithm, but rather than tracking the traverse of levels of fitness, 
we do the same to the levels of elitism, i.e., number of elite species 
in the population. 

We focus on species that can either evolve to the currently- 
best over 1 iteration or are already best. Therefore, the population 



is broken down into three disjoint subsets: 

a : currently best species 
/?! : species with next-best fitness 

: the rest of the population that cannot evolve over 1 generation 

Since IBS swaps exactly 1 bit between two parents, this partition in 
combination with the assumptions made above enables construction 
of a very precise model, since the value of a cannot 'jump' more 
than 1 level of fitness and only a-species can breed better population, 
but only /3i -species may evolve into a and change the probability of 
evolution. 

C. a-levels subpartition 

This additional partition is necessary for functions with plateaus for 
which we use potential functions explained above. The need for it 
becomes evident in the next section, when probabilities of evolving 
elite species on two types of functions are compared. In addition to 
the elitism levels partition, for functions with plateaus we need to 
subpartition the a-level. 

In slight abuse of notation in the rest of this article, we denote A 
the set of chromosomes in the population with the highest fitness. 
Also 7 is the length of the plateau of fitness. Therefore the set A 
can be partitioned into 

A = AoUAiU---U Aj-i 

where each subset Am has equal fitness. In order to differentiate 
between Am, we assign each elite species an additional auxiliary 
function, Vm that tracks progress to the next level by counting the 
number of 1-bits in the fitness level: Aq Ai ^ A-,-i with 

corresponding auxiliary values Vo < Vi . . . < V7-1, i.e. OneMax is 
used as an auxiliary function. Species with both highest fitness and 
auxiliary values can be viewed as super-elite or a*. 

In the next section we use the notation a, a* to denote the 
set of elite or super-elite species, an element of that set and the size 
of it. This is done to reduce notational clutter. 

IV. Elitism Levels Traverse Mechanism for Upper 
Bounds 

In this section we present the main result of the article on a general 
function that is later confirmed by further application to OneMax 
Test Function. We are interested in the upper bounds on optimization 
time (for explanation of Landau notation see e.g. Chapter 9 in [16J). 

The working of the Elitism Levels Traverse Mechanism can 
be illustrated by an example from immunology. 

There exists a population of species size TV, which is susceptible to 
M types of infection, which are mutually exclusive, i.e. a species 
cannot be infected by more than one infection. The size of each 
set of infected species cannot be larger than ruj. We denote E* 
an event that there are 1 < r < nij infected species of type j, 
of which exactly one spawns an infected offspring that destroys a 
healthy member of the population. Since the sets of infected species 
are mutually exclusive, by additivity we obtain the probability that 
any of the infected species adds exactly one infected offspring: 

^j = l ' ^j = lr = l ' j = l r = l j=l 



This expression is quite complicated for a number of reasons, e.g. the 
knowledge of rrij. Although we can find bounds on the partial sum 
of rows of Pascal triangle, it is guaranteed to make the derivation 
quite messy. Therefore we need to lower-bound this probability. We 
do this by considering only one infected species of each type rather 
than r and the event of spawning exactly one infected species by 
Ej. This gives us the lower bound on the total probability of adding 
exactly one infected offspring, which is proven in Appendix IaI 

M M 

p{e;) > p{Ej) ^ J2 p(E^) ^ E p(E^) (1) 

In the notation of EA, = . the number of pairs of parents in the 
recombination pool with parents that are able to produce exactly one 
elite offspring. (for < 5 < 1) is the number of elite individuals 
in the population that, once it is reached, the probability to generate 
an offspring with higher fitness is arbitrarily close to one, i.e. 1 — o(l). 
We also have n — 1 levels of fitness. Combining this with the upper 
bound on the probability of adding elite offsprings to the population, 
we obtain the upper bound (worst-case) on the optimization time of 
the algorithm: 



Er = O 



EE- 



(2) 



Derivation of the upper bound from Equation |2] is rather versatile. 
We need to identify pairs of possible parents < pi,p2 > such 
that there exists some probability of swapping bits between parents 
VPj(fc) > that as a results of applying a genetic operator to this 
pair either a new a species evolves from lower-ranked ones or an 
existing a is preserved after the recombination. 

Intuitively, for the functions with plateaus both the population 
size and the number of elite species are more important than for 
those without plateaus. In the remainder of this section we show 
that the probability to add a super-elite offspring when solving a 
function with plateaus is less than the probability to add an elite 
offspring when solving functions without plateaus. 

For the rest of this section we denote /i function without 
plateaus and /2 function with plateaus. What we show is that 

A. Functions without plateaus 

For this type of unimodal functions (e.g. OneMax) intuitively it is 
easier to add an elite offspring and thus reduce the optimization 
time, but we need to show it rigorously. 

The probability to select a pair with an a-parent can be bounded by 



p/i 



where is the probability to select a non-elite species to be paired 
with the elite one. Also bound the probability to flip the bits ipj (k) > 
f] V j, k. So the probability of an event Ej that includes pairs with 
elite species is 



P{E: 



The probability to select a pair without the the currently-elite species 
is lower-bounded by By breaking down the set of parents M/j 
in the recombination pool into those including a— parents, Mj^ 



and those that do not, Mf*, we can find the lower bound on the 
probability of adding another elite species: 



P(St 



2fi 2n 
B. Functions with plateaus 

As noted in 1 10], algorithms with well-chosen population size 
perform similar to, and best individuals evolve along the same path 
as (1 + 1)EA. The difference between (fi + A)and (1 + 1) lies in 
the cost of traversing plateau. For this type of functions the length 
of plateau 7 > 1. So we have K plateaus w.l.o.g. of the same length 
7 G Z^, and n = 'jK. 

Also we assume that at the start of the algorithm each 'bin' 
(plateau) starts with an equal number of I's and O's uniformly 
distributed, therefore fitness of the best species at the beginning of 
the run is 0. To track progress between jumps in fitness values we 
use OneMax as an auxiliary function (roughly along the lines of 
using potential or distance functions, see e.g. (7]) that sums bits in 
the plateau. 

The tricky part in this analysis is that the selection is based 
on fitness of the string rather than auxiliary function, but the 
progress towards the next level of fitness plateau depends on the 
number of parents with highest auxiliary value, Vs rather than /(s). 
By denoting the subset of a with highest auxiliary function a* , we 
notice that f{a) = f{a*), and Va < Va*. Also trivially a* < a 
(for the case of functions without plateaus these functions are 
identical and last two expressions are equalities). 

As shown before, for a unimodal function without plateaus 
regardless of fitness function, the probability that one of the parents 
is elite is ^, since if two elite species are selected for breeding, 
parent is chosen randomly. Obviously — < — . Additionally, 



2 



< - 



Obviously, unlike /i, for the evolution process on /2 only a 
small subset of parents are of use, these having the highest and 
next-highest auxiliary values. Therefore pairs that do not include 
at least 1 of these parents can't add an a* offspring. Similar to 

/i, = M}^+M}* and clearly M}^ < M}^ and M}* < M}*. 

Along the lines of arguments in the previous subsection, 3^2 
s.t. probability to select a non super-elite parent in addition to the 
super-elite one is upper-bounded by it. We get: 



sel — 



a* {a — a*)£,2 



so the probability of an event that an a* parent is added to a pool 
and a new a* offspring evolves is upper-bounded by 

^ 3 ' - 4^*2 

where r) is the lower bound on the probability of swapping bits. 
Therefore the probability to add one more a* species to the popula- 
tion is 



P{S, 



+ 



-ip? ' 2p 

ive, and taking Mjf * > Ai 
e < 1, we compare the values in the first and second fractions 



Combining the inequalities above, and taking Mjf* > Mf* + j^, < 



in the expressions for P{S^^i.) and P{S^^f.). It is easy to see that 
3 two constants tp2 < '^i s.t. 



fl 



V. Upper Bound on Runtime of + \)EAibs on OneMax 

TEST function USING ELITISM LEVELS TRAVERSE MECHANISM 

In this section we present our findings on the upper bounds on runtime 
of {fj, + A)EA with 1-Bit-Swap operator optimizing OneMax func- 
tion using the Elitism Levels Traverse Mechanism. We distinguish 
four pairs of parents that make possible evolution of currently-elite 
species: 

El :< a, Pi > 
E2 :< /3i,/3i > 
E3 :< a,/?_i > 
E4 :< > 



We do not consider the obvious pair < a, a > as it either adds 
two elite offsprings, of generates an offspring with higher fitness, 
something we do not use in the Mechanism. 

For the upper bound on optimization time we only consider 
increase of the number of elite species by at most one. Increase 
by two or more is ignored, or otherwise transformed into any of 
the lower-ranked species. Similar approach was used in [ 10] in 
bounding the takeover time. 

A. Simple upper bound 

Of these four cases we start analysis with the first two. Main reason is 
that the other two cases involve cubic function, which becomes quite 
complicated to solve (see next subsection). For the cases El , E2 we 
get the following probabilities of success: 



P{Ei 



2( 2Li(fc).^.^l 1 
y.Pi\{p — a)ifii{k) 



PiE2)^{l^Mk)[^ 



The probability of at least 1 of these events is 

2(pi{k)aPiX{fj, — a) 



P{Sc,.k) > P{Ei) + P{E2) = 



+ 



ip2{k)\l3fipi{k){fi - a) 



and, since P{S) is minimal, the upper bound on expected time to 
traverse levels of elitism large enough to get a 1 — o(l) probability 
of evolution is 



The expression for the expected first hitting time we obtain as a result 
of this setup is 



(3) Ef,,fe < 2/1^ 



^ PiX{fi - a){2afi(fii{k) - aPnp2{k) + ^i/xip2(fe)) 



E 



(V52(fc) - 2At(pi(fc))a2 + {2^j?ipi{k) - 2n<fi2{k))a + p^M^) 



Sjj, 



1 



X{(p2{k) — 2p(fii{k)) ^ q2 + bia + bo 
where 

fi^(p2ik) 



(5) 



bo = 
bi = 



tp2{k) - 2^iipi(k) 
2/i(/i(^i(fe) - V'2(fc)) 



((52 (fc) — 2fllfil{k) 

At this point we set /3i pessimistically to 1 to simplify the derivation. 
This a quadratic equation in a. The full solution to Equation |5] is in 
Appendix IbI 



The optimization time is 



(6) 



for some constant 62(1-1) ■ For the second option of S — the upper 
bound becomes 

ET(t.+x)EAiBs = Oi/J-nlogn) (7) 
or, in the number of function evaluations 

ET(f.+A)i5AiBs = C>(A/inlogn) (8) 
B. Refined upper bound 

We add the other two cases to obtain a sharper upper bound on 
optimization time, we set /3i = 1: 

-'-''-(l)7('-?)('-^)-"> 

The probability to evolve one more elite species is {P(Ei), P{E2) 
are the same as in the previous derivation): 

PiSc,k) = P{Ei) + P{E2) + PiEs) + P{Ei) 

and the expected time until there are 5^ elite strings in the population: 

6 ^ 



Sct.k 



= 2m^E' 



bzQ'^ + b2a^ + bia + feo 



where 

61 = 2\fi{fiipi{k) + fi'^ip-iik) — 3fiip4{k) — ip2{k)) 

62 = X{ifi2{k) — 4^^v53(fc) — 2^i(pi{k) — 6^i(p4.{k)) 

63 = 2A(pip3(fc) - V'4(fc)) 

Full solution of Equation |9] is in Appendix Icl 

The upper bound on expected optimization time is (for S 7^ ^ j c is 
a constant): 

c^^"'''^n log n 



Er, 



(M + A)BAibs 



A 



0(^1 (10) 



A 



for < e(/i) < 1 and if S = ^, in the number of function 
evaluations: 



ET(fi+A)i5AiBs = cfinlogn- 0{fin) 



(11) 



This bound is sharper than the one obtained using simpler arguments 
earlier in this paper up to the order A (since more possibilities of 
adding elite species are considered). It is also comparable to the 
results in |6|, |7|, [10] (see below). Such a result likely means that 
population has positive effect for some relatively small fi, but as it 
keeps increasing it either levels out (at best) or starts to degrade 
performance. 

C. Generations vs Function evaluations 

Tournament selection has a property that you do not need to evaluate 
every species, but we need to make 2A evaluations (since two species 
compete for 1 slot in the recombination pool, so the number of evalu- 
ations each generation is 0(A). Therefore, in terms of the number of 
functions evaluations the rough bound becomes O(^Anlogn) and 
the refined one 0{^n\ogn). If/i = A = 0(l) this reduces to 
the well-known result of 0(n log n) for OneMax function. The A 
term in the denominator means that for the algorithm run on parallel 
computers the increase in the recombination pool size improves the 
performance. 

D. Comparison to earlier results 

The closest comparison we can draw is to (A'' + A'^)EA with mutation 
and tournament selection function in 1101 . 0{nN log N + nlogn) 
if measured in the number of function evaluations (Proposition 4). 
By setting A'' = 0(1) — c > 1 this bound becomes nlogn + 
0{n), which is larger than just 0(n log n). If instead we set /i — 
N = 0(Vlog n) or 0( io'g°fog„ ) the result in flOl is sharper than in 
this paper. For populations Q{-^^^) though the bound in this article 

og n ^ 

becomes sharper again, e.g., for p — N — 0{^/n) it is cn^ logn — 
ni, and in |10| it is "^^g" +0(nlogn). 

VI. Discussion 

We presented a new tool to analyze population-based elitist EAs, 
Elitism Levels Traverse Mechanism, which we used to derive a new 
upper bound on {fi + A) EAs with a recombination operator and a 
variant of tournament selection solving OneMax problem. 

We derived and proved the lower bound on the probability of 
evolving exactly 1 new currently-elite species, which helped us 
obtain the upper bound on the expected optimization time. 

We showed that for a function with fitness plateaus it is harder to 
add a super-elite offspring to the population than an elite offspring 



for a function without plateaus. This means that the very number of 
super-elite species in the population is more important in the former 
case than the number of elite species in the latter. 

It may seem from the derived equations that population generally 
degrades performance (since fi is in the numerator), but for small 
size of population, when the cost of functions evaluations is not 
much different from 1, population brings about some positive effect. 

As it keeps increasing, the effect levels out, at the same time 
the costs of evaluating functions grows and population loses its 
benefit. For other algorithms, s.a. RLS the effect even of small-sized 
population is usually negative, which makes EA+IBS (and, possibly, 
other recombination-based algorithms) stand out. 

At the same time the recombination pool improves performance 
(at least when measured in terms of the number of generations), 
since A is in the denominator. This means there is a benefit from 
increasing recombination pool size when the algorithm is run on 
parallel computers. 

The Mechanism we have designed in this article proved to be 
quite efficient in deriving upper bounds for OneMax function and 
we are confident it can also yield tight upper bounds on other 
population-driven algorithms and more complicated problems. 

VII. Conclusions and Future Work 

There are many reasons to use population in evolutionary computing 
rather than just (1 + 1) or (1, 1) algorithms, that includes higher di- 
versity and shorter evolutionary path (see 1 10 1). We intend to expand 
the results in this article by considering the following extensions to 
the upper bound tool: 

1) Analysis of functions with fitness plateaus. Apparently for 
functions with fitness plateaus (e.g. Royal roads) both large 
populations and large number of elite parents are crucial 
compared to functions without one, so we will extend our 
findings to these functions as well. 

2) Typical runtime analysis. It is fairly obvious that the actual 
number of elite species grows every generation at some rate 
that realistically lies between the upper and lower bounds. We 
need to find an approximation on the expected number of a 
added to the population every generation and thus estimate the 
typical runtime. 

3) Elitism rates analysis. In this article we never really considered 
the rate of elitism, i.e. the actual number of species saved in the 
population each generation, although numerical computation 
shows that it has a strong effect on the runtime. So far we 
only said that all the elite species are saved each generation, 
thus accumulating over time till 5fi. It would be interesting to 
compare elitism level 1 to 50%, i.e. if there is any difference 
if only 1 species is saved compared to half of the population. 

4) Derivation of 5fj, to find the proportion of elite species that 
yields a high enough probability of evolution. Quite obviously 
it is different for functions with plateaus and without. 

5) Derivation of the optimal population size. We will do this by 
comparing the number of functions evaluations necessary of 
(1 + 1) and {fj, + A) algorithms. 
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p{E*) = p( y 



"J /a 



= E ( ^ ) ^-'(1 - ^-o^"'' ( [ 1 vAm - vAk)y 



In this expression Pswap is not necessarily the probabihty to swap 
bits < 0, 1 >. It is the probability to swap bits such that an elite 
offspring evolves. Since all the terms in the sum are positive, we use 
the lower bound on this expression: 

/a 

r 



(l-Psel)^ 



(l-Psel)' 

Canceling out <fij{k) and moving the term e^^ < (1 — Psei)^ < 
< 1 on the other side, LHS of the inequality becomes 

''a / \' 

r 

> {P,el{l~nik)) + 1-Psel)'^ 
= (1-P,e,^,(fc))* 

and the RHS is upper-bounded by 

1_ ^ ^Psci _ 1_ ^ o(\'^~^)hy the argument below 

LHS is lower-bounded by (using Bernoulli inequality for ^ > 1): 

A ^ XPsel^Piik) 



m*)>Eu ]PUl^^Ak)ni'Psel)^^ 



P{E*) > {1-Psem{k))^ > 1 



Appendix A 
Proof of Equations[T][2] 



Main idea and logic of the lower bound on the probability of adding 
an elite offspring and the upper bound on runtime following from 
this is presented in Section |IV| Here we present the derivation of 
this bound. 

We prove this lower bound inequality for an arbitrary subset 
(it is not to be confused with at trivial one of the form 



Since we can select Psei — o{X'^) and fj{k) — 0{:^),c £ Z, the 
expression is 

P{E;) = 1 - o{l) > ^ + o{l) ^ P{E,) (12) 

thus proving the upper bound on the probability of evolving 1 more 
elite species for an arbitrary subset. This logic applies for each of 
the M subsets (types of pairs) of the recombination pool, and the 
inequality becomes 

/ M \ M 

^ U^O>E^(^^) ^13) 
\j=i I j=i 

The upper bound in Equation [2] follows directly. 



Appendix B 
Solution of Equation[5] 

We have a quadratic equation 



with 



60 = 



if2{k) - 2fj,tpi{k) 

In order to simplify the already complicated derivation, we want the 
expression above in the form 

Pr,(a) = i = i 

for some r, not necessarily rational. From equating coefficients it 



becomes clear that 



r = Vbn W r = — 
2 



and so, using the first root 



^ (Q + r)2 

a — 1 ^ ^ 



= V-i (V &o) - ^/-i (V &o +5/1 + 1) 



For large feo these expressions involving digamma function can be 
expanded asymptotically in Taylor series (we use only the first two 
terms): 



S{fi,k) 



bo^ 2bo) ^\ bo ^ bo ^ 2bo ) ^ bo 
Sfj,{ip2{k) - 2^upi{k)) _ 5{(p2{k) - 2^ipx{k)) 



and therefore the expected time to traverse enough levels of elitism to 
improve 1 bit of the string is (plugging this expression into Equation 



2fi^ 



A(<^2(fc)-2^(^i(fc)) 
2n^5 



5{ip2ik) ^ 2fiipi{k)) 
fiip2{k) 



Xip2{k) 

To improve the pair < /3i , /3i > we need to either swap 1 from the 
first parent and from the second, or the other way around (any other 
outcome just keeps the current number of bits in each parent): 

,,, „ fc-1 n-k+1 2(fc-l)(n-fc + l) 

V52(fc = 2 ■ = 

n n n-' 

Plugging this into the expression for ETfc, we obtain the expected 
optimization time of the algorithm, pessimistically assuming that at 
the beginning of the run the best species has only 2 1-bit and finishes 
at n — 2, since if the fitness of /?i — n — 1 implies the fitness of 
a = n. 



Q 9 ~ n — 2 



^ frf, (k - l){n - k + 1) 



A 



1 / - - 1 - - 1 

n\^k-l ^n-k + 1 
\ fc=2 ; - 

log(n-l) + 0(l) 



The second step is due to partial fraction expansion. Although this 
seems quite a loose bound given cubic in ^, we take ^ — 0(A) so 
all we need to establish is 5 to reduce the power. 

Obviously < 5 < 1, but we need to select it s.t. summation 
over a makes sense. We set 5 — fJ,~^^ for an arbitrary 



El > s.t. Sfl 



> 1. Then S^^ 



1 + 52 



For example, £ = ^ yields Sjj, = ^/Jl and S^^ = \//?. Therefore, 
the upper bound on the expected convergence time is 



(14) 



In fact if (similar to |T0|) we set 5 = ^ for c G Z+, we get Sjj. = 
c and 5fj? = cfi = 0{fi), so the expectation becomes linear in fj,: 

E^^i+\)EAlBs ^ Oifinlogn) (15) 



Appendix C 
Solution to Equation[9] 

We need a solution to the cubic equation of the form 

a. <y. — l ^ 



where 



b'2 
b[ 

b'o 



V>2{k) — 4:ii^(fi3{k) — 2fj,ipi{k) — Qfi(p4{k) 

2{fi^s{k) - fi{k)) 
fi{fi(pi{k) + fi'^ipslk) — 3ii(p4{k) — y2(fc)) 

2{^l(p3{k) - ly34(fc)) 

fj.'^ {2^ip4,{k) + f2{k)) 

2(/i</?3(fc) - f4,{k)) 



Solution to S{iJ.,k) is of the form 
1 



1 



(a + p)^ ^-^ + 3a2p + Sqo^ + 

a a 

Equating the coefficients we obtain three roots p: 



^=3 

p = ± 



To simplify the increasingly hard notation, we select only the last 
root: 



6(1 

s{f^,k) = Yl 



+ 



M^/b[^ + 5^l + l)-MWo + ^) 



2(S/i + 1 



1 

2 

1 2&p, 5fi 



The second line in the derivation was obtained by expanding both 
second-order polygamma functions in Taylor series as b'o — > 00 and 
taking two first terms of each function. We now combine the front 
term in Equation |9] with this derivation to obtain the expression on 



the upper bound on achieving the number of elite species in the 
population S/i: 



/J,, fc 



2fj,^5 _ 



= /l^ (2^1/33 (fc) + ipA{k)) 

We are now ready to find the upper bound on the expected optimiza- 
tion time of the algorithm: 

(16) 

Here again we pessimistically assume that the best species at the start 
of the run has fitness 3, since in such case fitness of has minimal 
fitness of 1, otherwise we obtain inconsistencies s.a. i. We have two 
probabilities to consider for the two new types of pairs: 

,,, k k — 2 n — k n — k + 2 

< a, /3_i >: if3{k) = + 

n n n n 

_ k(k - 2) + (n - k){n - k + 2) 

We need to preserve the better parent in order to get it added to the 
population, so need to either select 1-bits in each parent or 0-bits in 
each parent, for the last swap probability, ipi(k), we need only to 
select a in the /3i parent and 1 in parent, other options either 
degrade the better parent or leave the current fitness. 

o o /,N (n-fc + l)(fc-2) 

We continue with manipulating with the summand over k: 

S{n,k)= ^ 



2/i(^3(fc) + (fi4{k) 



(4/i - l)fc2 + (n - - 4)111 + 3)fc + 4)171 - 2n + 2)irfi - 2 

'2 1 



II k^ - 4(n + 2)k + 2n{n + 1) 



We leave out the first fraction, and factor the denominator in the form 
{k — s){k — r), s.t. s,r are solutions to the set of equations: 



s + r = 4(n + 2) 
sr = 2n{n + 1) 



The resulting solution (we only use one of the roots, which are 
symmetric) is: 



s =2n + V2Vn'^ + 7n + 8 + 4 
r =2n- V2Vn'^ + 7n + 8 + 4 

The value under the root can be bounded by 

n + 2 < + 7n + 8< n + 4 

So the expression becomes upper-bounded by 

1 

(fe - {2n + V2{n + 2))){k - (2n - V2{n + 4))) 



Expanding this in partial fractions, we obtain 
1 / 1 



2V2{n + 3) \k-{2n + V2{n + 2)) 
1 

A; - (2n - ^(n + 2)) 



We obtain two sums over k: 

n — 3 

Si(n,k) = J2 



k- {2n + \/2(n + 2)) 

« i/'o(n — 2n — V2n) — ?/'o(3 — 2n — \/2?i) 

= Vo(-(l - V2)n) - ^Ao(3 - (2 + V2)n) 

= 0(1)-0(1) = -0(1) 

The result of — 0(1) is due to the fact that we can select any n, for 
which the values of digamma function are small negative constants 
(see Appendix |D] for details on Taylor series expansion of 'ipo{n) for 
n < 0). For the second sum, we notice the upper bound on the value 
in the denominator, since 2- V2 ^ 0.58 < 1: 

n — 3 - 

S2(n, fc) = y ^ 

fr"3 fc - (2n - V2(n + 2)) 

n—3 ^ n—3 ^ 

fc=3 fc=3 

the minus sign in front of the expression cancels out and we obtain 
the upper bound for S{ii,n): 

n2(log(n-3)-0(l) 



S(M,n) < 



fin 



and the upper bound on the expected first hitting time: 

25fi'^n{\og{n - 3) - 0(1)) 



A 



(17) 



with S = j-^ the expression becomes (measured in the number of 
generations, for c > 0) 



Er, 



{M + A)BAibs 



cfm log n jin 



A \ A 

or, in the number of function evaluations, 

Er(^+A)£;AiBs = cfinlogn- 0{fin) 



(18) 



(19) 



Appendix D 
Mathematical Expressions 

There is a number of important mathematical expression used 
throughout the article, we present some of them here: 



H{n) = J2 



""'l dx 



n — k ^-^ k 
Digamma function: 



log n < log n + 1 



ipo (n) = log n + O - 



For tpoin) with n — >■ — oo we use the largest term of Taylor series 
for asymptotic expansion: 

i^o(n) R:: ncot(nn) + o( — 
\ n 



The values for cot(7rn) for integer n, such as in this article, are 
infinity. Therefore for expressions for Si{n, k) and ^(n, k) we have 
selected some constants, e.g. —(1 — v^),2 — \/2, s.t. the resulting 
values are constants. Since n is arbitrarily large, we can find such n 
that the difference between them is negative, hence we obtain term 
-0(1). 



